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[57] ABSTRACT 

Language translation is accomplished by representing 
natural-language sentences in accordance with a constrained 
grammar and vocabulary structured to permit direct substi- 
tution of linguistic units in one language for corresponding 
linguistic units in another language. Preferably, the vocabu- 
lary is represented in a series of physically or logically 
distinct databases, each containing entries representing a 
form class as defined in the grammar. Translation involves 
direct lookup between the entries of a reference sentence and 
the corresponding entries in one or more target languages. 

22 Claims, 2 Drawing Sheets 
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METHOD AND APPARATUS FOR 
AUTOMATED LANGUAGE TRANSLATION 

FIELD OF THE INVENTION 

The present invention relates generally to automated 
language translation, and in particular to a system for 
translating constrained linguistic constructs assembled in 
accordance with an exact grammar. 

BACKGROUND OF THE INVENTION 10 

Since the time when improvements in transportation 
began to significantly reduce the inconvenience and cost of 
cross-border travel, the desirability of universal communi- 
cation has been recognized. In the 1960s, for example, 15 
international efforts were made to promote Esperanto as a 
universal language. While that effort ultimately failed, the 
large number of fluent speakers — between 1 and 15 million 
worldwide — and the scope of the efforts illustrate the prob- 
lem's importance. Esperanto did not succeed because it 20 
required acquisition of both a new grammar and a new 
vocabulary, the latter posing a far greater challenge for 
would-be speakers. 

The improving ease and speed with which information 
can now be transmitted worldwide has augmented the need 25 
for universal communication. Current efforts have focused 
most heavily on automated translation among languages. 
Systems now in use generally store, in a source and a target 
language, millions of frequently used words, phrases and 
combinations, relying for accuracy and robustness on the 30 
occurrences in the text to be translated. Such systems are by 
definition incomplete, since no system can possibly store 
every possible word combination, and their usefulness var- 
ies with the linguistic idiosyncracies of their designers and 
users. It is almost always necessary for a human to check and 35 
modify the output translation. These systems also translate 
one word at a time (and so operate slowly), and require a 
separate database unique to each target language. Moreover, 
because they are programmed to recognize distinctive lan- 
guage characteristics and their unique mappings from one 40 
language to another, each translation must be done individu- 
ally. In other words, the time required for multiple transla- 
tions is the sum of the times for each translation performed 
individually. 

Translation is difficult for numerous reasons, including 45 
the lack of one-to-one word correspondences among 
languages, the existence in every language of homonyms, 
and the fact that natural grammars are idiosyncratic; they do 
not conform to an exact set of rules that would facilitate 
direct, word-to-word substitution. It is toward a computa- 
tional "understanding" of these idiosyncracies that many 
artificial-intelligence research efforts have been directed, 
and their limited success testifies to the complexity of the 
problem. 55 

SUMMARY OF THE INVENTION 

The present invention provides an artificial grammar for 
expressing the thoughts and information ordinarily con- 
veyed in a natural grammar, but in a structured format 60 
amenable to automated translation. Sentences in accordance 
with the invention are constructed based on a fixed series of 
rules applied to an organized natural vocabulary. The gram- 
mar is clear in the sense of being easily understood by native 
speakers of the vocabulary and complex in its ability to 65 
express sophisticated concepts, but, because sentences are 
derived from an organized vocabulary according to fixed 
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rules, they can be readily translated from one language to 
another. Preferably, the vocabulary is represented in a series 
of physically or logically distinct databases, each containing 
entries representing a form class as defined in the grammar. 
Translation involves direct lookup between the entries of a 
reference sentence and the corresponding entries in one or 
more target languages. 

Unlike natural languages, the invention employs a finite 
(although flexible and extensible) lexicon, an exact set of 
form classes and a finite and exact set of rules for sentence 
formation. Starting with a term from one of four form 
classes, sentences can be constructed by iterative application 
of four expansion rules that govern the manner in which 
terms from the various classes can be combined. The result- 
ing "sentences," while unlike those of the natural language 
they are intended to represent, can nonetheless accurately 
and comprehensibly represent the full range of meanings of 
natural-language sentences. The invention exploits the rela- 
tive ease of learning a new grammar, particularly one that is 
highly constrained to a few precise rules, as compared with 
learning a new vocabulary. As a result, after becoming 
familiar with this grammar, the user can easily compose 
sentences in the manner prescribed by the present invention. 

Accordingly, to use the invention, a natural-language 
sentence is translated or decomposed into the (typically) 
simpler grammar of the invention but preserving the original 
vocabulary. Although it is possible to accomplish this with 
some degree of automation, the full benefits of the invention 
are most directly realized by manual development — either 
by primary translation or direct composition — in the inven- 
tion's grammar, which is easily learned and applied. And 
because translation involves simple substition of equivalent 
entries from different languages, multiple-language transla- 
tion of an input sentence is achieved almost instantly. 

The translated output is as easily understood by a native 
speaker of the target language as the input was to the author 
of the original text. Thus, it is possible to carry out "con- 
versations" in the grammar of the invention by formulating 
statements according to the grammar, passing these to an 
interlocutor for translation and response, and translating the 
interlocutor's responses. For example, a business person 
native to the United States and without knowledge of 
German can conduct a meeting with native German speakers 
using as a translation device a laptop computer configured in 
accordance with the invention, exchanging thoughts via the 
computer. Indeed, the same thoughts may be simultaneously 
broadcast to multiple interlocutors each speaking a different 
language, with their individual responses simultaneously 
and multiply translated as well. Correspondents can 
exchange messages by e-mail, in their native languages, 
simply by formulating the messages in accordance with the 
invention's grammar; recipients speaking different lan- 
guages who have e-mail systems implementing the inven- 
tion receive the message translated into their native 
languages, and their responses are automatically translated 
into the original sender's language upon their arrival; in this 
way, each correspondent is exposed only his or her native 
language. 

The invention is advantageously employed even in situ- 
ations demanding an ultimate output in a natural language, 
since translation to this format is readily accomplished. For 
example, a news reporter might file a story worded in the 
invention's grammar for dissemination to numerous bureaus 
serving different national audiences. The story is instantly 
translated into the appropriate languages upon arrival at the 
different bureaus, where it may then be further refined into 
a form suitable for communication to the audience. The 
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skills required if further translation is desired are essentially case, each definition of the word represents a different 

editorial in nature, and thus require less specialized training linguistic unit, the various definitions may appear as entries 

than would be necessary, for example, for true language in different form classes. For purposes of automation, each 

translation; indeed, communications media already employ definition is distinguished, for example, by the number of 

personnel to carry out the similar tasks of editing and 5 periods appearing at the end of the word. The entry for the 

revising raw news material taken from wire services. first (arbitrarily designated) definition is listed with no 

In accordance with the invention, sentences are composed^ period, the entry representing the second definition is listed 

r of "linguistic units," each of which may_be one -or a.fewj^ with one period at its end, and so on. Alternatively, different 

words, from the allowed form classes. These classes are word senses can be identified numerically, e.g., using sub- 

^ "things" or nominal terms that connote, for example, people/ 10 scripts. 
■ places, items, activities or ideas^connectors" that specify ^ Words unique to the invention may make up a very small 

^relationships between two (or more) nominal terms/ proportion of the total lexicon, and none of these words is 

"descriptors" modifying the state of one or more nominal y specific to the invention or alien to the natural language upon 

^terms; and 'logical connectors" establishing sets of the 7 which it is based. Instead, invention-specific words are 

nominal terms. The list of^ all allowed entries in all four ^ broadened in connotation to limit the overall number of 

classes represents the global lexicon of the invention. To' terms in the lexicon. For example, in a preferred 

construct a sentence in accordance with the invention, implementation, the word "use" is broadened to connote 

entries from the form classes are combined according to four employment of any object for its primary intended purpose, 

expansion rules detailed below. These rules can be followed ^so that in the sentence "Jake use book," the term connotes 

explicitly in a stepwise fashion to produce sentences, but 20 reading. The word "on" may be used to connote time (e.g., 

more typically, once me user is accustomed to the grammar, (i go-to ballegame) on yesterday). If desired for ease of use, 

sentences are constructed by "feel" and, if necessary, sub- however, the invention-specific words can be eliminated 

sequently tested for conformity with the expansion rules. altogether and the lexicon expanded accordingly. 

Thus, the invention solves the three obstacles noted The invention divides the global lexicon of allowed terms 
earlier that have prevented the emergence of truly robust 25 into four classes: "things" or nominal terrns thajxonnote, for 
translation systems. The idiosyncratic nature of different example, people, places, iterhs, activities or ideas7identified 
grammars is overcome by substituting a fixed grammar, and herein by the code T; ^connectors" that specify relationships 
the one-to-one correspondence problem is addressed between two (or more) nominal terms (including words 
through a specialized and finite database. Homonyms are typically described as prepositions and conjunctions, and 
handled by explicitly labeling the different senses of a word 30 terms describing relationships in terms of action, being, or 
and requiring explicit selection of the intended sense. These states of being), identified herein by C; ".descriptors'' modi- 
capabilities allow the invention to be conveniently and fying the state of _one_ or more nominal terms' (including 
inexpensively applied to many languages, even exotic ones; words typically described as adjectives, adverbs and intran/ 
current systems, by contrast, are directed almost exclusively / sitive verbs)7 identified herein by D; and~"16gical conriec- 
to the major languages due to the expense inherent in their 35 tors" establishing sets of the nominal terms, identified herein 
design. by £. Preferred lists of nominal terms, connectors and 
A representative hardware implementation includes a 7 descriptors are set forth in Appendices 1-3, respectively. 
r series of logically or physically distinct electronic databases/ Th e preferred logical connectors are "and" and "or." 
L j in which the vocabulary is stored, a computer memory ^ Naturally, the lexicon cannot and does not contain a list of 
L partition for accepting an input in a reference language and/ possible proper names; instead, proper names, like other 
c structured in accordance with the invention; and analysis / words not recognized by the invention, are returned inside 
means (generally arprocessor operated in accordance with ' angle brackets to indicate that translation did not occur. The 
stored-computer -instructions) for (i) addressing the data- system also does not recognize verb tenses; connectors are 
bases with the input to retrieve entries in the target language 45 phrased in the present tense, since tense is easily understood 
- corresponding thereto, and : (ii) translating the sentence by / from context. Tense may nonetheless be indicated, however, 
replacing the input with the identified target entries. ; " by specifying a time, day and/or date. 

* ' BRIEF INSCRIPTION OF THF ORAWTNfiS Sentences in accordance with the invention are con- 

BRIfc* DESCRIPTION OF THE DRAWINGS stmcted from lerms - m me kxicon according t0 four expan . 

The invention description below refers to the accompa- 50 sion rules. The most basic sentences proceed from one of the 
nying drawings, of which: following three constructions (any of which can be created 
FIG. 1 schematically illustrates application of the expan- " T ! e ™ fa , ac ?° rdanc f ^ *» ^Pfsion rules set 
sion rules of the present invention; and fortl J 1 «»iBinbdow). These structures, which represent the 
„_ „ . , . . , . . smallest possible sets of words considered to carry 
FIG. 2 is a schematic representation of a hardware system 5J Motion, are lhe building blocks of more complex sen- 
embodying the invention. tences -j^ structural simplicity facilitates ready transla- 
DETAILED DESCRIPTION OF AN ^ convelsational . patural-language sentences; thus, 
ILLUSTRATIVE EMBODIMENT eVe ° C °.? lpl , eX f^**.™ accordance with the invenUon 

are easily transformed into natural-language equivalents 
The system of the present invention makes use of a 60 through modular analysis of the more basic sentence corn- 
lexicon and a constrained set of grammar rules. The lexicon ponents (a process facilitated by the preferred representa- 
comprises linguistic units divided into four classes. Each lions described later). 

linguistic unit is (1) a single word, such as "dog" or Basic Structure 1 (BS1) is formed by placing a descriptor 

"government"; or (2) a combination of words, such as after a nominal term to form the structure TD. BS1 sentences 

"parking space" or "prime minister"; or (3) a proper name; 65 such as "dog brown" and "Bill swim" readily translate into 

or (4) a word with a definition unique to the invention; or (5) the English sentence "the dog is brown" (or the phrase "the 

one form of a word with multiple meanings. In the latter brown dog") and "Bill swims." 
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BS2 is formed by placing a connector between two 
nominal terms to form the structure TCT. BS2 sentences 
such as c £ctogeat food" readily translate into English equiva- 
lents. " " ' ~ 

BS3 is formed by placing a logical connector between two 
nominal terms to form a series represented by the structure 
TCT . . . The series can be a single conjunction, such as "Bob 
and Ted," or compound structure such as "Bob and Ted and 
Al and Jill" or "red or blue or green." 

A sentence comprising one or more of the basic structures 
set forth above may be expanded using the following rules: 
Rule I: To a nominal term, add a descriptor (T-*TD) 

In accordance with Rule I, any linguistic unit from the 
nominal class can be expanded into the original item fol- 
lowed by a new item from the descriptor class, which 
modifies the original item. For example, "dog" becomes 
"dog big." Like all rules of the invention, Rule I is not 
limited in its application to an isolated nominal term 
(although this is how BS1 sentences are formed); instead, it 
can be applied to any nominal term regardless of location 
within a larger sentence. Thus, in accordance with Rule I, 
TDi-^TDJDj. For example, "dog big" becomes "(dog 
brown) big" (corresponding to English sentence, "the brown 
dog is big"). 

The order of addition may or may not be important in the 
case of consecutive adjectives, since these independently 
modify T; for example, in "(dog big) brown," the adjective 
"big" distinguishes this dog from other dogs, and "brown" 
may describe a feature thought to be otherwise unknown to 
the listener. The order of addition is almost always important 
where a D term is an intransitive verb. For example, expand- 
ing the TD sentence "dog run" (corresponding to "the dog 
runs" or "the running dog") by addition of the descriptor 
"fast" forms, in accordance with Rule I, "(dog fast) run" 
(corresponding to "the fast dog runs"). To express "the dog 
runs fast," it is necessary to expand the TD sentence "dog 
fast" with the descriptor "run" in the form "(dog run) fast." 

Applying expansion Rule I to the structure BS2 produces 
TCT— (TD)CT. For example, "dog eat food" becomes "(dog 
big) eat food." Rule I can also be applied to compound 
nominal terms of the form TCT, so that a structure of form 
BS3 becomes T£T—(TCT)D. For example, "mother and 
father" becomes "(mother and father) drive." In this way, 
multiple nominal terms can be combined, either conjunc- 
tively or alternatively, for purposes of modification. It 
should also be noted that verbs having transitive senses, such 
as "drive "are included in the database as connectors as well 
as descriptors. Another example is the verb "capsize," which 
can be intransitive ("boat capsize") as well as transitive 
("captain capsize boat"). 

Rule Ila: To a nominal term, add a connector and another 
nominal term (T— »TCT). 

In accordance with Rule Ha, any linguistic unit from the 
nominal class can be replaced with a connector surrounded 
by two nominal entries, one of which is the original linguis- 
tic unit. For example, "house" becomes "house on hill." 
Applying expansion Rule \\a to BS1 produces TO-+(TCT) 
D; for example, "gloomy house" becomes "(house on hill) 
gloomy," or "the house on the hill is gloomy." 

Rule \\a can be used to add a transitive verb and its object. 
For example, the compound term "mother and father" can be 
expanded to "(mother and father) drive car." 
Rule lib: To a nominal term, add a logical connector and 
another nominal term (T-»TCT). 

In accordance with Rule 116, any linguistic unit from the 
nominal class can be replaced with a connector surrounded 
by two nominal entries, one of which is the original linguis- 
tic unit. For example, "dog" becomes "dog and cat." 
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Again, for purposes of Rule Ila and Rule Ub, a nominal 
term can be a composite consisting of two or more nominal 
terms joined by a connector. For example, the expansion 
"(john and bill) go-to market" satisfies Rule Ila. Subse- 
5 quently applying Rule I, this sentence can be further 
expanded to "((john and bill) go-to market) together. 
Rule III: To a descriptor, add a logical connector and another 
descriptor (D-»D£D). 
In accordance with Rule III, a descriptor can be replaced 

10 with a logical connector surrounded by two descriptors, one 
of which is the original. For example, "big" becomes "big 
and brown." Applying expansion Rule in to BS1 produces 
TD-*T(D£D); for example "dog big" (equivalent to "the 
dog is big," or "the big dog") becomes "dog (big and 

15 brown)" (equivalent to "the dog is big and brown" or "the 
big brown dog"). 

The manner in which these rules are applied to form 
acceptable sentences in accordance with the invention is 
shown in FIG. 1. Beginning with a nominal term such as cat, 

20 shown at 110, any of the three basic structures can be formed 
by following expansion Rules I, Ila and lib as shown at 112, 
114, 116, respectively, to produce "cat striped" (BS1), "cat 
on couch" (BS2) or "cat and Sue" (BS3). Iterative applica- 
tion of expansion rule Ila at 118 and 119 produces structures 

25 of the forms TQTj-^CTQTJCjTj or "((cat on couch) eat 
mouse)" and (TCJ 1 )CJ , 2 ^((TC 1 T 1 )C 2 T^C 3 T 3 or "(((cat 
on couch) eat mouse) with tail)." Expansion rule I can be 
applied at any point to a T linguistic unit as shown at 122 (to 
modify the original T, cat, to produce "(happy cat) on 

30 couch") and 124 (to modify "eat mouse"). Rule III can also 
be applied as shown at 126 (to further modify cat to produce 
"(((happy and striped) cat) on couch)") and 128 (to further 
modify "eat mouse"). 
Expansion Rule I can be applied iterative ly as shown at 

35 112, 130 to further modify the original T (although, as 
emphasized at 130, a descriptor need not be an adjective). 
Expansion Rule Ila is available to show action of the 
modified T (as shown at 132), and Rule I can be used to 
modify the newly introduced T (as shown at 134). Rule I can 
also be used to modify (in the broad sense of the invention) 
a compound subject formed by Rule Hfc, as shown at 136. 

The order in which linguistic units are assembled can 
strongly affect meaning. For example, the expansion 
TCjTj-KTCjT^Q^ can take multiple forms. The con- 

45 struct "cat hit (ball on couch)" conveys a meaning different 
from "cat hit ball (on couch)." In the former the ball is 
definitely on the couch, and in the latter the action is taking 
place on the couch. The sentence "(john want car) fast" 
indicates that the action should be accomplished quickly, 

50 while "(john want (car fast))" means that the car should 
move quickly. 

A more elaborate example of the foregoing expansion 
rules, which illustrates the utility of the invention in repre- 
senting a natural-language discussion, appears in the fol- 
55 lowing table: 

TABLE 1 

Zairian health officials said 97 people have died from the Ebola vims so 
6Q far. Jean Tarn fun, a virologist, who helped identify the virus in 1976, 

criticized the government's quarantines and roadblocks as ineffective. On 

Saturday the quarantine on the Kikwith region was offically lifted. 

heaUh-official/s of zaire 

•say* 

people 97 

'dead 
65 •because-of 

virus named ebola 
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TABLE 1 -continued 



jean -Lam fun be" 
virologist in zaire 
he help- 

scientist/s identify* 
virus named ebola 
•in 1976 

jcan-tamfun criticize* 
government of zaire 
he say* 

quarantine/s ineffective 
•and* 

roadblock/s ineffective 
government end* 
quarantine of* 
region named kikwit 
•on Saturday 



database 235 2 , an exemplary column of which appears in 
Table 3; descriptors are contained in database 235 3 , an 
exemplary column of which appears in Table 4; and logical 
connectors (most simply, "and" and "or") are contained in 
database 235 4 . 

TABLE 2 

NOMINATIVE TERMS 
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A representative hardware implementation of the inven- 
tion is shown in FIG. 2. As indicated therein, the system 
includes a main bidirectional bus 200, over which all system 
components communicate. The main sequence of instruc- 
tions effectuating the invention, as well as the databases 
discussed below, reside on a mass storage device (such as a 
hard disk or optical storage unit) 202 as well as in a main 
system memory 204 during operation. Execution of these 
instructions and effectuation of the functions of the inven- 
tion is accomplished by a central-processing unit ("CPU") 
206. 

The user interacts with the system using a keyboard 210 
and a position-sensing device (e.g., a mouse) 212. The 
output of either device can be used to designate information 
or select particular areas of a screen display 214 to direct 
functions to be performed by the system. 

The main memory 204 contains a group of modules that 
control the operation of CPU 206 and its interaction with the 
other hardware components. An operating system 220 
directs the execution of low-level, basic system functions 
such as memory allocation, file management and operation 
of mass storage devices 202. At a higher level, an analysis 
module 225, implemented as a series of stored instructions, 
directs execution of the primary functions performed by the 
invention, as discussed below; and instructions defining a 
user interface 230 allow straightforward interaction over 
screen display 214. User interface~230 generates words or 
grapMcarimTges~on~display'2i4~t6 prompt action by the' 
^userfand accepts user commands from keyboard 210 and/or 
position-sensing device 212. 

Main memory 204 also includes ^partition defining a 
series of databases capable of storing the linguistic units of 



15 



20 



25 



30 



actor 
address 
advertise- 
ment 
advice 
africa 
afternoon 
age 
aim 
air 

airplane 

airport 

algeria 

altitude 

aluminum 

ambassador 

amount 

animal 

ankle 

answer 

ant 

apartment 

appetite 

apple 

appointment 



35 



apricot 
april 
acchitect 
Argentina 
cheek 
cheese 
chemistry 
cherry 
chess 
chest 
chicken 
40 child 
chile 
chin 
china 
chocolate 
' christmas 
church 
cigar 
cigarette 
circle 
citizen 
clock 



45 



,the-invention, and ^jepresentatively denoted by reference sso cIol &ing 
^numerals 235 lf 235 2 , 235 3 , 235 4 . These databases 235, , 
./which may be physically distinctive., stored in different J 
^memor^parlitions and as separate files on storage device y 
,202) or logically distinct (i.e., stored in a single memory' , 
^- partition as a structured list that may be addressed as a^55 
f plurality of databases), jeacj^contain all of the linguistic units y 
^corresponding to a particular class in at least two languages. ^ 
< In other words, each database is organized as a table each of 
whose columns lists all of the linguistic units of the par- 
ticular class in a single language, so that each row contains 60 
the same linguistic unit expressed in the different languages 
the system is capable of translating. In the illustrated 
implementation, nominal terms are contained in database 
235 j, and a representative example of the contents of that 
database in a single language (English) — that is, the contents 65 
of one column in what would be a multi-column working 
database — appears in Table 2; connectors are contained in 



cloud 
clove 
club 
coal 
coat 

cockroach 
cocoa 
coffee 
collar 
Colombia 
color 
comb 
comfort 
competition 
computer 
concert 
condition 
connection 
conversation 



argument 

arm 

army 

arrival 

ait 

artist 

asia 

attic 

august 

aunt 

austral ia 

austria 

author 

authority 

avalanche 

baby 

back 

backpack 

bag 

baker 

balcony 

ball 

banana 

bandage 

bank 

barley 

bam 

barrel 

basket 

bath 

copy 

corkscrew 

com 

cost 

cotton 

couch 

country 

courage 

cousin 

cow 

cracker 

crane 

cream 

crib 

crime 

cuba 

cucumber 

cup 

curtain 

czechoslov 

akia 

damage 

dance eel 

danger 

date 

daughter 
day 
death 
debt 

december 

decision 

degree 

den mark 

dentist 

departure 

desert 

dessert 

diarrhea 

dictionary 

digestion 



bathrobe 

bathtub 

battery 

beach 

bean 

bear 

beard 

bed 

bedroom 

bee 

beef 

beer 

beet 

beginning 

behavior 

belgium 

bell 

belt 

benefit 

beverage 

bicycle 

bill 

billiard 

bird 

birth 

birthday 

bladder 

blanket 

blood 

blouse 

dinner 

direction 

disease 

dish 

distance 

document 

dog 

donkey 

door 

drawing 

dream 

dress 

driver 

drum 

duck 

dust 

eagle 

ear 

earring 

earthquake 

ecuador 

education 

flute 

egg 

egypt 

elbow 

electricity 

elevator 

end 

enemy 

energy 

engine 

engineer 

england 

entrance 

envelope 

ethiopia 

europe 

excuse 

exhibition 



boat 

body 

boltvia 

bomb 

bone 

book 

border 

bottle 

bottom 

bowl 

box 

boy 

bracelet 

brain 

brake 

brass 

brazil 

bread 

breakfast 

breath 

brick 

bridge 

broom 

brother 

brush 

building 

bulgaria 

bullet 

bus 

butcher 

export 

eye 

face 

factory 

fall 

family 

farm 

father 

february 

ferry 

fig 

finger 

fingernail 

fin land 

fire 

fish 

fist 

flea 

flood 

floor 

flour 

flower 

hail 

fly 

food 

foot 

football 

forest 

fork 

fox 

france 

friday 

friend 

frog 

front 

fruit 

funeral 

game 

garden 

garlic 



butter 

butterfly 

button 

cabbage 

cabin 

cafe 

cake 

camel 

camera 

camp 

Canada 

canal 

candle 

cane 

capital 

captain 

car 

cardboard 

cargo 

carpenter 

carpet 

carrot 

cash 

cat 

cattle 

cauliflower 

cellar 

cemetery 

chain 

chair 

gennany 

gift 

girl 

glass 

glasses 

glove 

glue 

goat 

god 

gold 

goose 

government 

grape 

grapefruit 

grass 

greece 

group 

guard 

guest 

guide 

gun 

gymnastics 
hair 

hairdresser 
half 

hammer 
hand 

handkerchief 

harbor 

harvest 

hat 

he 

head 

health 

heart 

heel 

here 

highway 
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TABLE 2-contimied TABLE 2-continued 



NOMINATIVE TERMS NOMINATIVE TERMS 



cook 


dining- 


exit 


gasoline 


hole 




restaurant 


shoe 


steak 


throat 


vaccination 


copper 


room 


expense 


gauge 


holiday 




result 


shoulder 


steel 


thumb 


vegetable 


bolikod 


key 


luggage 


movie 


pain 




rice 


side 


stick 


thunder 


velvet 


honey 


kidney 


lunch 


mushroom 


painting 
pair 




ring 


signature 


stock- 


thursday 


Venezuela 


horse 


kind 


lung 


mustard 




risk 


silk 


market 


ticket 


victim 


horse-race 


king 


machine 


nail 


pakistan 




river 


silver 


stomach 


tie 


view 


hospital 


kitchen 


magazine 


nail-file 


pancake 


10 


rocket 


sister 


stone 


tiger 


village 


hotel 


knee 


magic 


name 


panic 




roll 


situation 


store 


time 


vinegar 


hour 


knife 


maid 


nature 


pants 




roof 


size 


storm 


timetable 


violin 


house 


kuwait 


mail 


neck 


paper 




voice 


water 


weight 


window 


work 


hungary 


lace 


malaysia 


necklace 


parachute 




waiter 


we 


wheat 


winter 


year 


husband 


ladder 


malta 


needle 


parents 




wall 


weather 


where? 


woman 


yesterday 


I 


lake 


man 


neighbor 


parking 


15 


war 


wedding 


who? 


wood 


you 


ice 


lamb 


map 


nepal 


part 




waste 


Wednesday 


wife 


wool 


yugoslavia 


ice-cream 


language 


march 


netherlands 


partridge 




watch 


week 


wind 


word 


iceland 


lawyer 


market 


new- 


passport 






idea 


lead 


marriage 


zealand 


pea 














import 


leaf 


match 


newspaper 


peace 














india 


leather 


mattress 


nicaragua 


pear 


m 












Indonesia 


lebanon 


may 


nigeria 


peasant 


ZU 






TABLE 3 




information 


kg 


meat 


night 


pen 














ink 


lemon 


medicine 


noodle 


pencil 








CONNECTORS 




insect 


letter 


meeting 


noon 


people 






call 


from 


mix 




insurance 


Liberia 


melon 


north- 


pepper 




able- to 


shoot 


interpreter 


library 


member 


america 


persia 


25 


about 


called 


from 


more-than 


should 


invention 


Hbya 


memorial 


north-pole 


peru 


above 


capsize 


fry 


move 


sing 


Iran 


license 


metal 


norway 
nose 


pharmacy 




across 


capture 


give 


near 


smell 


iraq 


life 


mexioo 


Philippines 




afraid-of 


carry 


go-in 


need 


speak 


ireland 


light 


middle 


november 


physician 




after 


catch 


go- through 


occupy 


steal 


iron 


light-bulb 


milk 


number 


piano 




against 


cause 


go-to 


of 


sting 


island 


lightning 


minute 


nurse 


picture 




allow 


change 


hang 


on 


stop 


israel 


lime 


mistake 


nut 


pig 


30 


answer 


climb 


hate 


outside 


study 


a 


linen 


monday 


oak 


pigeon 




arrest 


close 


have 


pay 


take 


italy 


lion 


money 


oar 


pillow 




arrive-at 


cook 


hear 


play 


teach 


January 


lip 


monkey 


oats 


pilot 




ask 


count 


help 


prepare 


throw 


japan 


liquid 


month 


October 


pin 




at 


cut 


hit 


print 


to 


jewel 


liver 


moon 


office 


pine-tree 




bake 


deal-with 


hunt 


promise 


touch 


job 


living-room 


morning 


oil 


pipe 


35 


be 


decrease 


if 


prove 


translate 


joke 


lobster 


morocco 


olive 


plant 


because 


defeat 


in 


pull 


try 


jordan 


lock 


mosquito 


onion 


platform 




become 


deliver 


tn-front-of 


push 


turn-off 


juice 


look 


mother 


orange 


play 




before 


discuss 


tn-order-to 


put 


turn-on 


july 


loom 


mountain 


ore 


playing- 




begin 


down 


include 


read 


under 


june 


love 


mouse 


ox 


card 




behind 


drink 


increase 


reduce 


understand 


kenya 


luck 


mouth 


package 


pleasure 




bel ieve 


drive 


kill 


refuse 


until 


plum 


room 


skin 


story 


tin 


40 


bet 


drop 


lriss 


remember 


use 


pocket 


root 


skis 


stove 


tire 




betray 


eat 


know 


repeat 


value 


poison 


rope 


sky 


street 


toast 




between 


examine 


learn 


ride 


visit 


poland 


rubber 


sled 


student 


tobacco 




blame 


explain 


leave 


roast 


want 


police- 


ru mania 


smell 


subway 


today 




bother 


find 


like 


say 


wash 


officer 


russia 


smoke 


sugar 


toe 




break 


finish 


live- in 


sec 


while 


porter 


rust 


snake 


summer 


toilet 


45 


bring 


fix 


look- for 


sell 


win 


portual 


saddle 


snow 


sun 


tomato 




burn 


for 


made-of 


send 


with 


post-office 


saddness 


soap 


sunday 


tomorrow 




but 


for 


make 


sew 


work-for 


postcard 


safely 


socks 


surprise 


tongue 




buy 


forget 


meet 


shave 


write 


pot 


saftey-belt 


soda 


swamp 


tool 














potato 


sailor 


soldier 


sweden 


tooth 














powder 


salt 


solution 


Switzerland 


toothbrush 


50 












prison 


sand 


son 


syria 


top 








TABLE 4 




problem 


Saturday 


song 


table 


towel 














property 


sauce 


sound 


tail 


town 








DESCRIPTORS 




purse 


saudi- 


soup 


tailor 


toy 














quarter 


arabia 


south-africa 


taste 


train 




abroad 


dean 


flat 


long 


round 


queen 


squsage 


south- 


tax 


tree 


55 


absent 


dear 


fly 


malignant 


run 


question 


scale 


america 


tea 


trip 




again 


cold 


forbidden 


maybe 


sad 


rabbit 


scarf 


south-pole 


teacher 


trouble 




agree 


complain 


foreign 


mean 


safe 


radio 


school 


soviet- 


telephone 


truth 




alive 


continue 


fragile 


more 


short 


rag 


science 


union 


television 


tuesday 




all 


correct 


free 


much 


sick 


rain 


scissors 


space 


tent 


tunisia 




almost 


cough 


fresh 


mute 


similar 


raincoat 


Scotland 


spain 


test 


turkey 


60 


alone 


crazy 


fun 


mutual 


sit 


rat 


screw 


spice 


thailand 


tv-show 




also 


cry 


funny 


my 


sleep 


razor 


sea 


spoon 


theater 


typewriter 




always 


curious 


glad 


nervous 


slow 


receipt 


self 


spring 


they 


umbrella 




angry 


damp 


good 


neutral 


slowly 


record- 


September 


staircase 


thief 


uncle 




another 


dangerous 


goodbye 


never 


small 


player 


shape 


stamp 


thigh 


united- 




any 


dark 


green 


new 


smile 


refrigerator 


she 


star 


thing 


states 


65 


argue 


dead 


grey 


next 


soft 


religion 


sheep 


starch 


thirst 


Uruguay 


artificial 


deaf 


grow 


nice 


some 


rent 


shirt 


station 


thread 


us 




automatic 


decrease 


guilty 


north 


sometimes 
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TABLE 4-continued 



DESCRIPTORS 



225 identifies the separate linguistic units and the expansion 
points. It then compares these with templates corresponding 
to the allowed expansion rules to validate the sentence, 
following which database lookup and translation take place. 
5^Ifttiesentence fails to conform to the rules of the invention, 
module 225 alerts the user via screen display 214. 

In accordance with either of these representation formats, 
plurals in English are noted by adding 7s" to the end of a 
singular noun (e.g., "nation/s"). In other languages, the most 
10 generic method of forming plurals is used; for example, in 
French, *7s" is added as in English, but in Italian, 7i w is 
added. Numbers are expressed numerically. 

Alternatively, analysis module 225 can be configured to 
process unformatted input sentences. To accomplish this, 
is module 225 looks up each input word (or, as appropriate, 
groups of words) in databases 235 and builds a representa- 
tion of the sentence in terms of the linguistic classes com- 
prising it — that is, replacing each unit with its linguistic 
class symbol. Module 225 then assesses whether the result - 
20 ing sequence of classes could have been generated in 
accordance with the allowed expansion rules, and if so, 
groups the linguistic units to facilitate lookup and transla- 
tion. The output is provided either in an unstructured format 
corresponding to the input or in one of the formats set forth 
_. 25 above. The latter form of output is preferred, since word 

An input buffer 240 receives from the user, via keyboard strings in one language rarely correspond sensibly to word 
210^ran input sentence that is preferably structured in/" _^ strings in another language produced solely by substitution; 
accordance with the invention and Jormatted-as described j t is generally easier to comprehend output in a form that 
below In this case, analysis module 225 initially examines isolates the linguistic units and highlights expansions, 
the input sentence for conformance to the structure. Follow- 30 ^ invention may incorporate additional features to 
ing this, module 225 processes .sin^e hnguisUc units of the > shn ]i{ operation. For example, as noted above, words 
input sentence in an iterative fashion addressing the data- >> hayin muUi , e are differentiated by ending riods; 

'bases to locate the entries corresponding to each linguistic > „ atll ,3i„ ~a t n • 

i • .l - -i ii l > naturally, the number of periods following a particular sense 

'unit in the given language, as well as the corresponding , t . . • u-. u • a a- , 

„ nt - :„- tK r # ,™t ,f , ' * i • j i ~e * - , of me word represents an arbitrary choice. Accordingly, an - 

entnes in the target language. Analysis module 225 trans- ' „.. v _ . . . K « e j- — - J* ^ 

i- t 4 . . . i ■ • • . .. . . , -35 ^additional database 235 can comprise a dictionary of words 
lates the sentence by replacing the input entries with the/ < k „ ■ „ u - , • 

ont • fr - ,„ , , * 4 - *u r i •* ^having multiple meanings, with the invention-recognized 

entnes from the target language, entering the translation into > V 7 f u ci j L . 



available 


deep 


hang 


not 


sour 


backward 


defective 


happen 


now 


south 


bad 


different 


happy 


often 


special 


bashful 


difficult 


hard 


okay 


stand 


beautiful 


dirty 


healthy 


old 


strong 


begin 


drop 


heavy 


open 


sweet 


black 


drown 


hungry 


our 


swim 


blind 


dry 


illegal 


permitted 


talk 


blond 


early 


important 


pink 


tall 


blue 


east 


increase 


play- 


thinks 


boil 


easy 


intelligent 


please 


there 


boring 


empty 


interesting 


poor 


thick 


born 


enough 


jealous 


portable 


thin 


brave 


expensive 


kiss 


possible 


think 


broken 


expire 


large 


previous 


tired 


brown 


extreme 


last 


quiet 


together 


burn 


far 


late 


red 


too-much 


capsize 


fast 


laugh 


rest 


transparent 


careful 


fat 


lazy 


rich 


travel 


change 


few 


left 


right 


ugly 


cheap 


first 


legal 


ripe 


upstairs 


urgent 


warm 


wet 


worry 


young 


wait 


weak 


white 


wrong 


your 


walk 


west 


why? 


yellow 





an output buffer-245 whose contents- appears on screen 
display 214. 

It must be understood that although the modules of main 
memory 204 have been described separately, this is for 
clarity of presentation only; so long as the system performs 
all necessary functions, it is immaterial how they are dis- 
tributed within the system and the programming architecture 
thereof. 

In order to facilitate convenient analysis by module 225, 
input sentences are preferably structured in a characteristic, 
easily processed format that facilitates both straightforward 
identification of individual linguistic units and simple veri- 
fication that the the sequence of units qualifies as a legiti- 
mate sentence in accordance with the expansion rules of the 
invention. In one approach ("portrait form"), each linguistic 
unit of a sentence appears in a separate line. If an expansion 
has been applied, an asterisk~(*)"is used to mark where the 
expansion occurred; that is, the * is used To connect basic 
r sentence structures together to form larger sentences. For 
example, drawing from the entries in FIGnr 

cat striped 

*hit* 

ball red 

represents the results of steps 132 and 134. 

Alternatively, the sentence can be expressed in an alge- 
braic ("landscape") format where expansions are identified 
by enclosing the expansion terms in parentheses: 

(cat striped) hit (ball red) 

In either case, the user's input is treated as a character 
string, and using standard string-analysis routines, module 



format of each sense of the word set next to the various 
definitions. User interface 230 interprets the user's clicking 
on one of the definitions as selection thereof, and enters the 
40 proper encoding of the word into input buffer 240. 

Similarly, because considerations of economy and speed 
of operation limit the overall desirable size of the databases, 
pnej)f_the„databases 235 can be set up as a thesaurus that - 
gives the closest invention-recognized linguistic unit to an; 
^unrecognized input^ord. In operation, when following an 
unsuccessful attempt by analysis module 225 to locate a 
word. in _the databases, module 225 can be programmed to 
consult the thesaurus database 235 and return a list of words ' / 
that do, in fact, appear in theflinguistic-unit databases. 
50 — Module 225 can also include certain utilities that recog- 
nize and correct (e.g., after approval by the user) frequently 
' made errors in sentence construction. For example, the 
_ ., present invention ordinarily indicates possession by a named 
person using the verb "to have"; thus, the sentence "Paul's 
55 computer is fast" is represented (in algebraic format) as 
J "paul have (computer fast)" or "(computer of paul) fast"; if 
the person is unnamed, the usual possessive pronouns may 
be used (e.g., "(computer my) fast"). Thus, module 225 can 
be configured to recognize constructions such as 
60 "PauTs"and return the appropriate construction in accor- 
dance with the invention. 

It will therefore be seen that the foregoing represents a 
convenient and fast approach to translation among multiple 
languages. The terms and expressions employed herein are 
65 used as terms of description and not of limitation, and there 
is no intention, in the use of such terms and expressions, of 
excluding any equivalents of the features shown and 
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described or portions thereof, but it is recognized that 
various modifications are possible within the scope of the 
invention claimed. For example, the various modules of the 
invention can be implemented on a general-purpose com- 
puter using appropriate software instructions, or as hardware 
circuits, or as mixed hardware-software combinations. 
What is claimed is: 

1. A method of translating information from a first lan- 
guage into a second language, the method comprising: 

a. providing reference and target sets of nominal, 
connector, and descriptor databases in reference and 
target languages, each nominal database comprising a 
series of nominal entries, each connector database 
comprising a series of connector entries each specify- 
ing a relationship between at least two nominal terms, 
and each descriptor database comprising a series of 
descriptor entries describing nominal entries, the 
entries of the reference set of databases corresponding 
to the entries of the target set of databases; 

b. generating a sentence in the reference language from a 
plurality of entries from the reference set of databases 
according to rules defining a constrained grammar; 

c. addressing the target set of databases with the reference 
entries to retrieve target entries corresponding thereto; 
and 

d. translating the sentence by directly substituting the 
reference entries with the target entries. 

2. The method of claim 1 further comprising the step of 
providing logical -connector databases in reference and tar- 
get languages, each logical-connector database comprising a 
series of entries establishing sets. 

3. The method of claim 2 wherein the sentence is gener- 
ated by selecting an item from the nominal database and 
expanding the sentence by applying at least one of the rules: 

a. to a nominal entry, add a descriptor entry from the 
descriptor database; 

b. to a nominal entry, add a connector entry from the 
connector database and another nominal entry from the 
nominal database; 

c. to a nominal entry, add a logical connector from the 
logical-connector database and another nominal entry 
from the nominal database; and 

d. to a descriptor entry, add a logical -connector entry from 
the logical-connector database and another descriptor 
entry from the descriptor database. 

4. The method of claim 2 wherein the logical-connector 
database comprises the entries and, or. 

5. The method of claim 1 wherein the nominal entries 
name a person, place, thing, activity or idea. 

6. The method of claim 5 wherein the nominal entries 
include the terms set forth in Table 2. 

7. The method of claim 1 wherein the connector entries 
show action, being or state of being. 

8. The method of claim 7 wherein the connector entries 
include the terms set forth in Table 3. 

9. The method of claim 1 wherein the descriptor entries 
describe a quality, quantity, state or type of a nominal entry. 

10. The method of claim 5 wherein the descriptor entries 
include the entries set forth in Table 4. 
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11. Apparatus for translating information from a first 
language into a second language, the apparatus comprising: 

a. first database means comprising a series of nominal 
entries in a reference and at least one target language; 

b. second database means comprising a series of connec- 
tor entries in a reference and at least one target 
language, the connector entries each specifying a rela- 
tionship between at least two nominal entries; 

c. third database means comprising a series of descriptor 
entries in a reference and at least one target language, 
the descriptor entries describing nominal entries; 

d. means for accepting an input in the reference language, 
the input comprising entries from the database means 
organized in accordance with rules defining a con- 
strained grammar; and 

e. analysis means for (i) addressing the target set of 
databases with the input to retrieve target entries cor- 
responding thereto and (ii) translating the sentence by 
directly substituting the target entries for the input. 

12. The apparatus of claim 11 further comprising fourth 
database means comprising a series of entries establishing 
sets. 

13. The apparatus of claim 12 wherein the analysis means 
is configured to ensure that the input conforms to a sentence 
constructed in accordance with expansion rules comprising: 

a. to a nominal entry, add a descriptor entry from the third 
database means; 

b. to a nominal entry, add a connector entry from the 
second database means and another nominal entry from 
the first database means; 

c. to a nominal entry, add an entry from the fourth 
database means and another nominal entry from the 
first database; and 

d. to a descriptor entry, add an entry from the fourth 
database and another descriptor entry from the third 
database. 

14. The apparatus of claim 12 wherein the fourth database 
means comprises the entries and, or. 

15. The apparatus of claim 11 wherein the nominal entries 
name a person, place, thing, activity or idea. 

16. The apparatus of claim 15 wherein the first database 
means includes the terms set forth in Table 2. 

17. The apparatus of claim 11 wherein the connector 
entries show action, being or state of being. 

18. The apparatus of claim 17 wherein the second data- 
base means includes the terms set forth in Table 3. 

19. The apparatus of claim 11 wherein the descriptor 
entries describe a quality, quantity, state or type of a nominal 
entry. 

20. The apparatus of claim 19 wherein the third database 
means includes the terms set forth in Table 4. 

21. The apparatus of claim 11 further comprising a 
thesaurus for identifying, for an input term not matching any 
of the database entries, a database entry closest in meaning 
to the input term. 

22. The apparatus of claim 11 further comprising means 
for displaying the translation. 



05/28/2004, EAST version: 1.4.1 



